Block clustering with collapsed latent block models

نویسندگان

  • Jason Wyse
  • Nial Friel
چکیده

We introduce a Bayesian extension of the latent block model for model-based block clustering of data matrices. Our approach considers a block model where block parameters may be integrated out. The result is a posterior defined over the number of clusters in rows and columns and cluster memberships. The number of row and column clusters need not be known in advance as these are sampled along with cluster memberhips using Markov chain Monte Carlo. This differs from existing work on latent block models, where the number of clusters is assumed known or is chosen using some information criteria. We analyze both simulated and real data to validate the technique.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Completely random measures for modelling block-structured sparse networks

Statistical methods for network data often parameterize the edge-probability by attributing latent traits such as block structure to the vertices and assume exchangeability in the sense of the Aldous-Hoover representation theorem. These assumptions are however incompatible with traits found in real-world networks such as a power-law degree-distribution. Recently, Caron & Fox (2014) proposed the...

متن کامل

Integrated Classification Likelihood for Model selection in Block Clustering

Block clustering (or co-clustering or simultaneous clustering) aims at simultaneously partitioning the rows and columns of a data table to reveal homogeneous block structures. This structure can stem from the latent block model which provides a probabilistic modelling of data tables whose blocks arise from row and column clusters. For continuous data, each table entry is typically assumed to fo...

متن کامل

Nanomedicine for tuberculosis: Insights from animal models

Patient noncompliance to current tuberculosis (TB) therapy owing to multidrug administration daily leads to treatment failure and emergence of multidrug resistant and extensively drug resistant TB. To avoid the daily dosing, application of nanotechnology is the only viable solution by virtue of sustained release of drugs. Other potential advantages of the system include the possibility of selec...

متن کامل

Collapsed Blocks Approximations

The collapsed blocks approximations (CBAs) are a class of approximations to multivariate distributions based on full conditional distributions for blocks of variables. The CBAs are useful when the full conditional distributions are known and computationally cheap to sample from and evaluate (i.e. for Markov models). In addition knowledge about the dependence structure is needed (e.g. spatial mo...

متن کامل

Nanomedicine for tuberculosis: Insights from animal models

Patient noncompliance to current tuberculosis (TB) therapy owing to multidrug administration daily leads to treatment failure and emergence of multidrug resistant and extensively drug resistant TB. To avoid the daily dosing, application of nanotechnology is the only viable solution by virtue of sustained release of drugs. Other potential advantages of the system include the possibility of selec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Statistics and Computing

دوره 22  شماره 

صفحات  -

تاریخ انتشار 2012